Quantum information is incompressible without errors 
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A classical random variable can be faithfully compressed into a sequence of bits with its expected 
length lies within one bit of Shannon entropy. We generalize this variable-length and faithful scenario 
to the general quantum source producing mixed states pi with probability pi. In contrast to the 
classical case, the optimal compression rate in the limit of large block length differs from the one in 
the fixed-length and asymptotically faithful scenario. The amount of this gap is interpreted as the 
genuinely quantum part being incompressible in the former scenario. 
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One of the fundamental questions in the information theory is about the data compression, namely, what is the 
r shortest description of a data. This question is important not only for quantification of the amount of information, 
' but also for understanding how well we can manipulate information stored in physical systems, which is a central 
, ^ topic in the field of quantum information. In the case of compressing a classical random variable X which takes 
letter x with probability p{x), the answer is given by the quantity called the Shannon entropy, which is defined as 
H{X) = — J2xPi-'^)^'^S2Pi^) when measured in bits Q. More precisely, a compression scheme is regarded as an 
^ I ' assignment of each letter x with a codeword of length l{x) bits, which is not fixed and dependent on x. Under the 
requirement that the codewords are uniquely decodable (faithful) even when they are concatenated to describe a 
■ sequence of letters, the minimum Lmin of the expected length L = J2xPi^)K^) satisfies ^ H{X) < Lmin < H{X) + 1. 
^ ] When a sequence of n letters independently drawn according to p(x) is collectively compressed, the minimum expected 
lO ■ length lies between nH{X) and nH{X) + 1 because of the additivity of the entropy function. Hence the optimum 
compression rate -R^pt for block length n, defined as the minimum expected length per letter, satisfies 

O : H{X) < Rill < H{X) + 1/n, (1) 

(N ■ 

O ; and thus Ri°^^ = hm„_oo = HiX). 

In addition to this 'variable-length and faithful' (VLF) scenario, we encounter the Shannon entropy also in a slightly 

Qh. different scenario of compression, in which all codewords have a common fixed length, and small errors in decoding 
^ ■ are allowed as long as they vanish in the limit of large block-length. In this 'fixed-length and asymptotically faithful' 

^ (FLAF) scenario, we consider a sequence of compression schemes labeled by the block length n, which have code length 

^ L^"-* and error probability perr • The sequence is asymptotically faithful when lim„^ooPc"r = 0. The asymptotic rate 
of compression for the sequence is characterized by i? = lim„_»ooi'-"V'^- The optimal compression rate R^^^ in this 
ILj^ ' scenario is defined as the infimum of R among all asymptotically faithful sequences. Requirement for this scenario 
is weaker than the VLF scenario in spite of the constraint on the codeword length. This is seen by composing a 
FLAF sequence with asymptotic rate i?opt + ^ by just repeating m times a VLF scheme with rate i?opt and treat the 

c3 ' concatenated codewords longer than mn{Rl^^ -\- S) as errors. With n fixed and in the limit of m — > oo, this sequence 
is asymptotically faithful for any 6 > due to the law of large numbers. Hence we have 



>< 



^opt — ^opti \/^) 

and i?opt'' — ^opt- the case of compressing a classical random variable X, it is known that R^p^ is also equal to the 

Shannon entropy, namely, -R^^' = R^pt = H{X) 

In the quantum case of compressing an ensemble £ — {pi,pi}, we consider a source that emits a quantum system 
in state (letter) pi with probability pi, and n systems drawn from this source are compressed into qubits. The 
discussions were first focused on the FLAF scenario, and the result when pi are all pure states |J| is R^I = Sip), 
where S{p) = — Tr[plog2/5] is the von Neumann entropy of the average state p = 'Y^^PiPi- When {pi] includes 
mixed states, R^^t is still given by the von Neumann entropy after removing hidden redundancy This striking 
similarity to the classical case implies that there is no big difference in the compressibility between classical and 
quantum information, at least in the FLAF scenario. Recently, the investigations on different scenarios §-|l| have 
been started to reveal how the optimal rates vary depending on small differences in the constraints. A notable example 
is the information defect, the difference between the rate R^^t defined above (which is referred to as the blind scenario 
in this context) and the rate in an easier scenario (the visible scenario) in which the identity of the index i is available 
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in the compression stage. The information defect has turned out to be nonzero even in the classical cases where 

all {pi} commute, implying that the origin of this gap does not necessarily lie in the nature of quantum information. 
In this Letter, we discuss the VLF scenario for general quantum ensemble £ = {Pi,Pi}, and derive inequalities 

corresponding to Eq. (P, which identify R^^^ . We show that the gap Af-af = R'^t ~ ^opt is generally nonzero, 
and nonzero only if {pi} do not commute. When we further separate the information in £ into the classical part and 
the quantum part, the origin of the gap becomes transparent, namely, the genuinely quantum part of the information 
turns out to be incompressible in the VLF scenario. 

The keystone in our derivation of i?opt'' is the observation that the length of the qubits used in a single round of 
the compression operation can be regarded as an outcome of a measurement on the system to be compressed. This 
is justified in an operational sense as follows. Suppose that initially there is a resource of A'^ qubits, and an input 
state from the source £ = {pi,Pi}(j>i > 0) acting on Hilbert space are compressed into a number of qubits via a 
VLF scheme. The notion that "L qubits has been used to compress the input state" means that there should remain 
N — L qubits in the resource which can be utilized in other independent tasks. This means that after the state is 
decompressed back in 7i^, the value L can be determined without accessing 7i^. In other words, if we generally 
write the whole quantum operation of the compression and decompression as a unitary operation U on the combined 
system of Ti.'^ and an auxiliary system TYe initially prepared in a standard pure state Se, we should be able to define 
an observable L acting on He, which corresponds to the length of qubits used to compress. The expected length (L) 
of the compression scheme U applied to the source £ is then written as 

{L) = Tte{LTta[U{p tE)U^]}, (3) 

where p = '^^PiPi- At the same time, since the compression scheme is faithful (no errors), U obeys 

TYE[U{p^®tE)U^]}^P^. (4) 

The length of qubits is thus regarded as the outcome of a generalized measurement on that introduces no 
disturbance on the initial states {pi}. 

The property of the operations that preserves the initial states {pi\ was analyzed in detail recently | p2[ , and it was 
shown that, given {pi}, we can find a unique decomposition of TYa, the support of p, written as 

WA = 0^f®<. (5) 

I 

Under this decomposition, pi is written as 



Pt = 



0p(-')p(-')0p({\ (6) 
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where Pj*'''' and p^ are normalized density operators acting on Ti^p and 7i|{\ respectively, and is the probability 
for the state to be in the subspace ® T^k ■ Pk is independent of i, and {pj^'^\ p'f'''\ ■ ■ ■} cannot be expressed in 
a simultaneously block-diagonalized form. Any U that satisfies Eq. (|^) is then expressed in the following form 

;7(iA®SE) = 0if ^l^^eIIk ®^e), (7) 
I 

where t/^g are unitary operators acting on the combined space Ti.'^ ®'He, satisfying TrE[C/4'E(/°K^ ®^'e)U^^]} ~ p^ ■ 
An explicit procedure to obtain this particular decomposition is also given in . 
The total density operator p = ^^PiPi for the ensemble £ is also decomposed as 

P-®P^'^pT®pI\ (8) 
I 

where p^'^ = J^iPiP'^'^''^ pT = (SiP«p'*''^/°7*''')/p''^- Now, let us take a basis {|aj)j'^} (j = l,...,dim Tij') 
for each , and consider another source £' — {qx,<7\} with double index A = which is defined through the 

decomposition ^ such that q\ = (dim ?ij and 

^x = \a,)TTi^j\®PK- (9) 
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The total density operator u = 'Ylixly^'^y^ f^'' this source is 

^^0p(O(diniH^'V'ij^®p|^^- (10) 

Suppose that this source is used as an input to the same compression scheme. The form assures that the operation 
[/ also preserves a\, namely, the scheme IJ works as a VLF scheme for Z' . We further note that, since the difference 
between p and a lies in the internal states of Tij , and the form ensures that the state of the auxiliary system 
(TYe) after the operation IJ is insensitive to them, we obtain TrA[C/(/5® Se)^^^] = TrA[t^((5''X>SE)C^^]- Then, according 
to Eq. (H), the same expected length (L) should be observed even when the source is replaced with E' . Hence we have 

imin(£) > imi„(f')- (H) 

In order to find a lower bound of Lmin(i?'), we use Eq. (|^), which also holds for general quantum ensembles. 
Since the letter states a\ of the ensemble £' are all orthogonal, the optimum rate should be equal to 

that of an orthogonal pure-state ensemble {^a, |A)}, and hence R^^^{E') — — X^a '?->^ ^°S2 9a- Using this with q\ — 
(dim H^'V V\ Eq. (|) with n = 1, and Lmi„(f ') = R^'k^') (by definition), we have 

in.in(f') > RoU^') = W) + ^NC(^), (12) 

where Ic(£) = P^'^ log2 v'^^ and Dnc (■?) = Ez P^'^ log2 dim uf . 

For an upper bound for L^yJ^E)^ we consider a specific example of the VLF scheme as follows. The compressor first 
makes a projection measurement to determine the index /, which appears in the decomposition (^). The projected 

state should be supported by subspace Hj ■* ® vl^l . Since the outcome I occurs with probability p^'^ , there exists an 
instantaneous code for encoding I with an expected length not larger than ]i{\jP^Y) + 1 bits The compressor 
write down this classical codeword onto qubits using the standard basis {|0}, It then discards 'Hk\ and transfer 

the state of TYj^ (which should be Pj*''^) into [log2 dim Hj^] qubits and concatenate them after the qubits holding 
the classical codeword for I. The expected total length of the qubits is then not larger than Ic{£) + 1 + Dnc(^) + 1- 
The decompression is done by measuring qubits in the standard basis one by one, until the end of the codeword is 
reached. Note that the decompressor knows this end point since the code for / is instantaneous. Learning it then 
transfers the contents of the next [log2 dim Tij'-*] qubits into 7ij'\ and prepare 1-6^ in the known state . When the 
input to the compressor was pi, the conditional probability of I is p^'^'-*. Hence the whole process faithfully reproduces 
Pi [see Eq. (^)]. We thus obtain an upper bound lc{E) + D^c{E) + 2 for L^^^^Sf.). Together with Eqs. ( pT| ) and (p^), 
we have 



Ic{£) + D^c{£) < imi„(^) < Ic{S) + Dnc{£) + 2. (13) 

In order to extend this result to the case of n-block coding, we need to know the behavior of the functions Ic{£) 
and Z?Nc(^) for the concatenation of independently drawn letters (states). Suppose that Ha is prepared by a source 
£a ~ {pf , }, and let Ha — ®; "^aj ^ak be the decomposition determined by Sa- Suppose further that another 
system Hb is independently prepared by a source £b = {p^, /5^}, and let Hb = 0„ '^Bi ®^bk^ be the decomposition 
determined by Eb- This preparation can be also considered as the combined system Hab = Ha ® Hb being prepared 
by ^AB = {Pi'vf T pt ■ It was shown [ p^ that the decomposition determined by ^ab is simply given by the direct 

product Hab = (^aj ^b?) (^ak ^ ^bk )• This imphes that Ic{E) and D^g{£) are additive, namely, 

^c(^ab) = Ic{£a) + -^c(^b) and £)nc('^^ab) = ^nc('^^a) + D^ci^B)- Just as in the classical case, this additivity leads 
to our main results, 

Ic{E) + D^c{£) < Rill{£) < Ic{£) + D^c{£) + 2/n, (14) 

and 

Ri'^,H£)^Ic{£) + DME). (15) 

This optimal compression rate R'^^^{£) for the VLF scenario is generally larger than the one in the FLAF scenario, 
which was derived in |^ for general mixed-state cases as 
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Rt^,{£)=Ic{£) + IMS), (16) 

where /nc(^) = J2iP^''^ ^ip'j'')^ S'^'i ^c(^) — — J2iP^'''^ logaP^'^ is the same function as defined above. The difference 
Af-af = ^opt - -f^opt IS 

Af-af = DM£) - Inc{£) = 5I^^*'^[log2 dim H^p ~ S{pP)], (17) 

I 

which is nonzero when there exists I such that pj' ^ (dim 7ij^)^^ij^ The gap Af-af is zero when = 
(dim 7ij'')~^ij-' for aU /. In particular, for the classical cases in which all pi commutes, (dim Tij"*) = 1 for all I 
ll and Af-af = 0. 

The gap Af-af stems from the internal state of each TC^ , which is regarded as the genuinely quantum (nonclassical) 
part of the ensemble £ in the following sense: (i) This part is inaccessible without introducing disturbance [Eq. (|^)] 
and (ii) this part can be compressed only into qubits, not into classical bits Then, the present results are 

summarized in a simple statement, 'the genuinely quantum part of the information is incompressible if no errors are 
allowed.' In contrast, the classical part of the information, which is represented by {p*-*''-*}, can be compressed into 
Ic bits in either one of the scenarios. The difference between the classical and the quantum part in the nature of 
compressibility may be understood as follows. The classical information can be effectively compressed by changing 
the description length adaptively depending on the input states, namely, using shorter descriptions for frequent inputs 
and longer descriptions for rare inputs. The quantum information, if it is to be compressed faithfully, does not allow 
such adaptation because learning the input state inevitably introduces irreversible disturbance. 

It may be quite instructive to contrast the derived gap Af-af with the information defect, the gap between the 
blind and the visible scenario. In the visible scenario, the classical index i is given to the compressor, rather than the 
state Pi to be reproduced in the decompression. The optimal compression rate Jeff for asymptotically faithful schemes 
in the visible scenario is called the effective information. By definition, R^p^ > /cff , and the gap Ab-v = ^^j^t — -^off > 
is called the information defect. While the explicit form of the effective information is still open, it is bounded from 
below by the Levitin-Holevo function [|l^ /lh = S{p) —Y^iPi^iPi)' namely, /eg > /lh An upper bound for Ab-v 
is hence R^pt — /lh = + ^nc — ^lh, and using Eqs. (^) and (|^), we obtain 

Ab-v < - T^P^'^^iPK) = T.P^S{pf), (18) 

i I i 

where the state = 0^ the one obtained from pi by removing the redundant part p^ [see Eq.(^]. Let 

us call an ensemble £ 'pure' when all the states pf^ are pure states, and call it 'mixed' otherwise. Then, Eq. ( [T^ ) 
implies that Ab-v = for pure ensembles. It should be noted that the pure/mixed classification is independent of the 
one by classical (all pi commutes) and quantum. It has been shown ||5|,|Tot that there exist classical ensembles with 
Ab-v > 0. The properties of the two gaps Af-af and Ab-v are summarized in Table |. When the letter states are 
distinct, namely, {pi} are all orthogonal, the optimal compression rates for the three scenarios are equal. When the 
letter states overlap with each other and are not completely distinguishable, there can be two types of overlaps. One 
is the cases when each letter state is noisy. In such cases, whether or not the identity of each letter is given as side 
information affect the compressibility. The other way of overlapping is quantum one, where the letter states are pure 
and suffers no noises, but are not completely distinguishable from each other due to nonorthogonality. In such cases, 
the compressibility depends on whether small errors (even an asymptotically vanishing one) are allowed or not. 

This work was supported by a Grant-in- Aid for Encouragement of Young Scientists (Grant No. 12740243) and a 
Grant-in- Aid for Scientific Research (B) (Grant No. 12440111) by Japan Society for the Promotion of Science. 



Information 


classical 
{[pi,pj] = 0) 


quantum 


pure 
iSipf) = 0) 
mixed 


n.(oo) _ rAF _ r 
-f^opt — -rt-opt — Jeff 

d(°°) _ dAF ^ jr 

-n-opt — -"opt ^ Jeff 


J*opt ^ Jl-opt — Jeff 
^o^' > ^op^ > /off 



TABLE I. Nature of information and gaps between optimal compression rates for various scenarios — the variable-length, 
faithful and blind scenario (R^^), the fixed-length, asymptotically faithful and blind scenario (R^pt), and the fixed- length, 
asymptotically faithful and visible scenario {led)- 
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